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FIELD OF THE INVENTION . 

; ; ■ ™* , i 

i 1 , 

, This invention relates to newly isolated nucleic acid molecules and their uses. The 

nucleic acid molecules are shown to be upregulated by the cytokine interleukin-9 ("IL-9"). 
10 Also disclosed are the proteins encoded thereby. They are described as T Cell Derived 

Inducible Factors ("TIFs"). These nucleic acid molecules encode proteins which induce STAT 
activation in cells. They can be used, for example, in the stimulation of regeneration of 
targeted tissues . Further, their inhibitors or antagonists can be used to retard, prevent or 
inhibit differentiation of other tissues. 

15 BACKGROUND AND PRIOR ART 

The last decade has seen knowledge of the immune system and its regulation expand 
tremendously. One area of particular interest has been that of research on the proteins and 
glycoproteins which regulate the immune system. One of the best known families of these 
molecules are the cytokines. These are molecules which are involved in the "communication" 

20 of cells with each other. The individual members of the cytokine family have been found to 

be involved in a wide variety of pathological conditions, such as cancer and allergies. 
Whereas sometimes the cytokines are involved in the pathology of the condition, they are also 
known as being therapeutically useful. 

Interleukins are one type of cytokine. The literature on interleukins is vast. An 

25 exemplary, but by no means exhaustive listing of the patents in this area includes U.S. Patent 

No. 4,778,879 to Mertelsmann et al.; U.S. Patent No. 4,490,289 to Stern; U.S. Patent No. 
4,518,584 to Mark et al. ; and U.S. Patent No. 4,851,512 to Miyaji et al., all of which involve 
interleukin-2 or "IL-2. " Additional patents have issued which relate to interleukin-1 ("IL-l"), 
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such as U.S. Patent No. 4,808,611 to Gosman. The disclosure of all of these patents are 
incorporated by reference herein. More recent patents on different interleukins include U.S. 

Patent Nos: 5,694,234^ S^bb^^'s^TljgS and 5,215,895 (IL- 

. i * ' 1 it.." 1, ' ( ■ 

11); 5,728,377, 5,716,251,-5,328,989 (IL^O); 5,580,753, 5,587,3021 5,157,112, 5,208,218 

5 (IL-9); 5,194,375, 4,965,195 (IL-7); 5,723,120, 5,178,856 (IL-6), and 5,017,691 (IL-4). 

' . . 1 '''' ,' v ' 

Even a cursory review of this patent literature shows the diversity of the properties of the 

members of the interleukiri family. One can assume that the larger cytokine family shows 

even more diversity. 'See, e.g. , Aggarwal et al. , ed. ,' Human Cytokines: Handbook For Basic 

And Clinical Research (Blackwell Scientific Plications, 1992) v Paul, ed., Fundamental 

10 Immunology (Raven Press,! 1993), pg 763-836, "T-Cell Derived Cytokines And Their 1 

Receptors", and "Proinflammatory Cytokines and Immunity." All cited references are 
incorporated by reference. ' , , 

The relationships between various cytokines 1 are complex. As will be seen from the 
references citbd herein, as the level of a particular cytokine increases or decreases, this can 

15 affect the levels of other molecules produced by a subject; either 'directly or indirectly. 

Among the affected molecules are other cytokines. 

The lymphokine IL-9, previously referred to as "P40," is a T-cell derived molecule 
which was originally identified as a factor which sustained permanent antigen independent 
growth of T4 cell lines. See, e.g., Uyttenhove et al., Proc. Natl. Acad. Sci. 85: 6934 (1988), 

20 and Van Snick et al. , J. Exp. Med. 169 : 363 (1989), the disclosures of which are incorporated 

by reference, as is that of Simpson et al., Eur. J. Biochem. 183: 715 (1989). 

The activity of IL-9 was at first observed on restricted T4 cell lines, failing to show 
activity on CTLs or freshly isolated T cells. See, e.g., Uyttenhove et al .. supra , and Schmitt 
et al., Eur. J. Immunol. 19: 2167 (1989). This range of activity was expanded when 

25 experiments showed that IL-9 and the molecule referred to as T cell growth Factor III ("TCGF 

III") are identical to MEA (Mast Cell Growth Enhancing Activity), a factor which potentiates 
the proliferative response of bone marrow derived mast cells to IL-3, as is described by 
Hultner et al., Eur. J. Immunol, and in U.S. patent application Serial Number 498,182 filed 
March 23, 1990, the disclosures of both being incorporated by reference herein. It was also 

30 found that the human form of IL-9 stimulates proliferation of megakaryoblastic leukemia. See 

Yang et al., Blood 74: 1880 (1989). Recent work on IL-9 has shown that it also supports 

2 
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erythroid colony formation (Donahue et al:, Blood 75(12): 227 1-2275 (6-15-90)); promotes 
the proliferation of myeloid erythroid burst formation (Williams et al. , Blood 76: 306 : 31 1 (9- 
1-90); and supports clonal maturation of BFU-E's of adult and fetal origin (Holbrook et al., 
Blood 77(10): 2129-2134 (5-15-91)). Expression of IL-9 has also been' implicated in 
5 Hodgkins's disease and large cell anaplastic lymphoma (Merz et al. , Blood 78(8): 131 1-1317 

(9-l-90) r Genetic analyses of mice that were susceptible or resistant to the: development of 
bronchial hyperresponsiveness have unraveled a linkage with the, IL T 9 gene /as well as a , i 
correlation between IL-9 production and susceptibility in this model ( Nicolaides et ah. Proc. 
Natl. Acad. Sci. USA, 94, 13175-13180, 1997). Human genetic studies also point to the IL-9 

10 i and IL-9R genes as candidates for asthma ( Doull et al.. Am. J. Respir. Grit. Care Med., 153, 

1280-1284, 1996; Holrovd et al. . Genomics 52, 233-235, 1998). Secondly, IL-9 transgenic ( 
mice allowed for the demonstration that increased IL-9 expression result in lung mastocytosis, s 
hypereosinophilia, bronchial hyperresponsiveness and high levels of IgE ( Temann et al. J J. 
Exp. Med. 188, 1307-1320, 1998; Godfraind et al. . J. Immunol. 160, 3989-3996, 1998; 1 

15 McLane et al. . Am. J. Resp. Cell. Mol. 19:713-720 (1999). Taken together, these 

observations strongly suggest that IL-9 plays a major role in this disease Additional work has 
implicated IL-9 and muteins of this cytokine in asthma and allergies. See, e.g. PCT 
Application US96/12757 (Levitt, et al), and PCT US97/21992 (Levitt, et al), both of which 
are incorporated by reference.. 

20 IL-9 is known to affect the levels of other molecules in subjects. See Louahed et al. . 

J. Immunol. 154: 5061-5070 (1995; Demoulin etal. . Mol. Cell. Biol. 16: 4710-4716 (1996), 
both incorporated by reference. It will be recognized that the molecules affected have their 
own functions in biological systems. For example, Demoulin et al. show that many of the 
known activities of IL-9 are mediated by activation of STAT transcription factors. As such, 

25 there is continued interest in trying to identify molecules whose presence and/or level is 

affected by other molecules, such as cytokines. 

The disclosure which follows describes such molecules. It was found that nucleic acid 
molecules encoding the proteins of the invention were expressed in the presence of IL-9, but 
not in its absence. Hence, these molecules are, inter alia, "markers" for the expression or 

30 effect of IL-9 in a subject. The molecules are referred to as T Cell Derived Inducible Factors 

3 
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or "TIFs" hereafter. These and other features of the invention will be seen in the disclosure 

, ''. ' , 1 ( 1 , 'i , 1 

which follows. , , , , i , . 

1 , ' i ■ ' r| ' 1 ,' - i' 

1 l^'fc t I ( | , |' lj l! i' , .,|'| ,t ( ( ' i ' i I i | I '' ' i l i ■ ,| 

BRIEF DESCRIPTION OF THE FIGURE , , 

Figure 1 compares deduced amino acid sequences of murine and human TIF (SEQ ID 
5 NOS: 27 and 28, respectively). , ; , i ! ' ] ( '< , 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

Example 1 i , 

The murine lymphoma cell line BW5 147 is well knovvh as a cell line which can be 1 
grown in vitro , withput the need to add any cytokines to its culture medium. In order to 
10 identify genes induced by IL-9, samples of BW5147 were cultured either with (200 U/ml), or 

without IL-9, for 24 hours. ■ Then, total RNA was isolated, using guanidium isothiocyanate 
lysis, and CsCl gradient centrifugation. These techniques are well known in the art. 
Following this, polyadenylated RNA was, purified from the total RNA, by using an oligo(dT) 
cellulose column. The isolated, poly A RNA was then used to generate double stranded 
15 cDNA. A commercially available oligo(dT) primer was used. Anywhere from 3-5 ug of 

poly A RNA were heated to 70 °C for 10 minutes with 1/xg of oligo dT, and then incubated 
with 5x first strand buffer (250 mM HC1 (pH 8.3), 375 mM KC1, 15 mM MgCy, 10 mM 
dithiothreitol, 500 uM of deoxynucleotide triphosphates, and 800 U of reverse transcriptase. 
Total volume of the reaction mixture was 20 ul, and the reaction was allowed to proceed at 
20 37°C for one hour. This resulted in synthesis of the first stand of cDNA. Second strand 

synthesis was accomplished by adding 30ul of 5 second strand buffer (lOOmM Tris-HCl (pH 
6.9)), 450mM KC1, 23mM MgCl 2 , 0.75mM p-NAD + , 50mM (NH 4 ) 2 S0 4 , together with 60U 
of E. coli derived DNA polymerase I, 2U of E. coli RNase H, 10 U of E. coli DNA ligase, 
and 250 uM of deoxynucleotide triphosphates, and brought to a final volume of 150 ul. The 
25 mixture was incubated for two hours, at 16°C. 

The product was extracted using phenol-chloroform, and was precipitated with ethanol. 
The final cDNA product was then resuspended in 200 yul of TE. 

These steps were carried out for both the stimulated BW5147 cells ("tester" hereafter), 
and for parallel, unstimulated BW5147 cells ("driver" hereafter). 

4 
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Example 2 i . * ' ■ , 

The cDNA prepared in Example 1, was then subjected to subtraction cloning in 

i ( K - ' 1 ■ i' ii «' ,im . , i ! . t . 1 i ,| 

1 1 i ,i 

accordance with well known methods. To do this,,six oligonucleotides were prepared: 
5 -AGCACTCTCC'AGCCTCTCXc CGCA^3 (SEQ ID' NO: lj; , 

i ' 1 1 • 

• ( ' - 1 1' 1 

5 5'-GATCTGCGGT GA-3' (SEQ ID NO: 2); , ,> • 1 

5'-ACCGACGTCG ACTATCCATG AACA-3' (SEQ ID NO: 3); 
5' GATCTGTTCA TG-3' (SEQ ID NO: 4); i, ,, ' 1 

. ■ . ' ■ '" '■■ ' 

5 ' -AGGC A ACTGT GCTATCCGAG GGAA-3' (SEQ ID NO: 5); and 

10 , 
S^GATCTTCGCT CG-3* (SEQ ID NO: 6). 

i 

These were used as explained herein. Double stranded cDNA (2 ug), was digested with 
restriction endonuclease DpnII, extracted with phenol-chloroform, precipitated with ethanol, 
and resuspended in 20 ul of TE (lOmM Tris-HCl (pH 7.5); ImM EDTA). Twelve ul (1.2 

15 ug), of cut cDNA was ligated to double stranded SEQ ID NOS: 1 and 2, in a mixture which 

included 4 ul of desalted SEQ ID NO: 1 (2mg/ml), 4 ul desalted SEQ ID NO: 2 (1 mg/ml), 
10/xl of 5X adapter buffer (330mM Tris-HCl, pH 7.6, 50mM MgCl 2 , 5mM ATP), 7^1 DTT 
(lOOmM), and 28/tl of H 2 0). The oligonucleotides were annealed to each other and to the 
sample DNA by heating the mixture to 50°C and then cooling it to 10°C over one hour, 

20 followed by adding 5ul of T4 DNA ligase, and incubation for 12-14 hours, at 12-16°C. The 

mixtures were diluted by adding 140 ul of TE. PCR was then carried out on 200 ul samples, 
as described infra. 

Example 3 

To carry out PCR, 200 ul samples containing 2 ul of the ligation product in a buffer 
25 of 66 mM Tris-HCl, pH 8.8, 4 mM MgCl 2 , 16 mM (NH 4 ) 2 S0 4 , 33 ug/ml BSA, 0.3 raM of 

each dNTP (concentration: 500 /zM), and 2 ug of SEQ ID NO: 2 were first heated at 72°C 

5 
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' i 1 

for three minutes to remove any of SEQ ID NO: 1 v^hich was hybridized to the product of 
Example 2. The 3' ends were then filled in by using 5U of Taq polymerase (5 minutes, 
72 ° C) . Twenty cycles of amplification w erh carried but ( l 1 cycle : l 1 minute at 95 ° C , arid three 
minutes at 72°C), after which products were combined, phenol extracted, ethanol precipitated, 
5 and resuspended in TE buffer, at a concentration of 0.5 ug/ul. Hereinafter, this is referred 

to as the representation. ^ . ', ■ '* ' , . 1 'i ' ; . ',' ' ' "'i' ■ 

Example 4 .' 1 ' \ : ( ' 

' The representation was then prepared for subtracitivp hybridization by removing SEQ 
ID I^O: 1 therefrom by digestion with Dpn II. The resulting digest was phenol extracted and 

10 ethanol precipitated. In the case of the unstimulated sample, this resulted in the driver, while 

the stimulated sample resulted in the tester! Portions of tester (20 ug) were gel purified on 
a 1.2% agarose gel and isolated. Samples (2 ug), were ligated to SEQ ID NOS: 3 and 4, in 
the same way that SEQ ID NOS: 1 and 2 were ligated, as described, supra. 

In a first cycle of subtractive hybridization, 0.4 ug samples of tester with SEQ ID 

15 NOS: 3 and 4 ligated thereto were mixed with 40 ug of driver cDNA. The mixture was 

phenol extracted, ethanol precipitated, dissolved in 2 ul of 3XEE buffer (30mM EPPS pH 
8.0), 3mM EDTA; pH 8.0, 3 mM EDTA. This was overlaid with 30 ul of mineral oil, and 
denatured for five minutes at 98 °C. A 5M NaCl solution (0.5 ul) was added, and DNA was 
hybridized for 20 hours, at 67 °C. The reaction mixture was diluted to 200 ul with TE, and 

20 tRNA carrier. The samples were incubated for three minutes at 72 °C to melt away SEQ ID 

NO: 4, and then four PCR reactions (200 ul) were prepared. These included 20 ul of diluted 
hybridization mix without primer, to fill in the ends of the reannealed tester, followed by 10 
cycles of amplification after adding samples of SEQ ID NO: 3 (1 cycle: lminute at 95°C, three 
minutes at 70 °C) after which products were combined, phenol extracted, ethanol precipitated, 

25 and resuspended in 40 jd of 0.2XTE buffer. Single stranded DNA was degraded by a 30 

minute treatment of 20 /xl of this material with 20U of mung bean nuclease, at a total volume 
of 40 ul. Samples was diluted (1:5), in 50 mM Tris-HCl, at pH 8.9, followed by five minutes 
of heating at 98°C to inactivate the enzyme. A second PCR was carried out, using 20 ul of 
the product described supra , 2 ul of SEQ ID NO: 3 (1 mg/ml), and 1 ul (5 U) of Taq DNA 

30 polymerase. A total of 18 cycles (1 cycle: 1 minute at 95 °C, three minutes at 70°C) were 

6 
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carried out. Products were combined, phenol extracted ■ ethanol precipitated, and resuspended 
at 0 5-1 ue/ul. The product is referred to hereafter as "DPI", or the first difference product. 

1 • r. , . 1 . ' • 

i 1,1 , . . < , * f 

Example 5 

DPI was then digested with endonuclease DpnII, as described above, and was ligated 

i ' •! ' I , ' ' M f. . ( , * 

5 to SEQ ID NOS: 5 and 6, following the same processes described for SEQ ID NOS: !, 2, 3 

^■i , i • -'i* ' ■ • 

and 4. Subtractive hybridization and selective amplification, as described in example 4, was 
repeated, and second difference product, or "DP2", was generated. In these experiments, 50 
ne of DPI was the tester. The driver (40 ug), was as described supra . The process was 
repeated to generate a third difference product, using SEQ ID NOS: 3 and 4 as adapters. To 
10 generate the third product, 100 pg of tester were mixed with 40 /xg of driver. All steps of the 

protocols supra were repeated, except the final amplification was carried out for 22 cycles, 
where one cycle was one minute at 95 °C, and three minutes at 70°C. This yielded the final 
difference product. i , 

Example 6 

15 The final difference products were digested with DpnII, and then cloned into the 

BamHI site of a commercially available vector, i.e. , ptZ19R. Double stranded DNA plasmids 
were prepared, and then sequenced, using standard methods. The sequences were compared 
to known sequences in the GenBank and EMBL data bases, using a BLAST search program. 
At the end of this subtraction procedure, a short cDNA fragment was identified, i.e., 

20 a fragment about 200 base pairs long. This fragment was used to screen a cDNA library from 

BW 5147 cells. The largest clone was sequenced. It is discussed infra. It does not 
correspond to any known sequence. 

The nucelotide sequence (SEQ ID NO: 7), is 1121 bases long, including a 537 base 
pair open reading frame, which encodes a protein 179 amino acids long. The predicted 

25 molecular weight of the protein is 20,093. There are two additional ATG codons which, if 

they acted as start codons, would produce proteins 172 and 167 amino acids in length, with 
molecular weights of 19,335 and 18,770 daltons, respectively. Each form of the protein is 
characterized by a sequence of hydrophobic amino acids which would be cleaved off of the 
molecule via the endoplasmic reticulum to provide a mature protein. 

7 
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Analysis of the sequence shows three AT rich motifs (TTATTTAT). These motifs are 
often found in 5' - untranslated regions of cytokines and oncogenes. Kruvs. et al. . Science 
245 : 852 (1989), have 1 shown that thfese tepeats' modulate stability of rhRNA fbr TIF., 

' j ' ■ " ' ' 1 1 1 1 ■ " ,h ; 1 ■ ,'' ' ■ 1 

' 1 1 ■ ' ' ' 1 1 1 '. ," V' ' ' /, ' > ! 

Example 7 

5 ThdcDNA isolated and analyzed in example '6, isupra i was then used as a probe to 

■ ' , ' I I ^ 

identify genomic DNA for TIFa. i l ( , 

A genomic library prepared from mouse strain 129 v^as. screened ,with SEQ ID NO: 7, 
following standard methods. An EcoRI fragment from a positive clone was subcloned into 
plasmid pZERO and partially sequenced. The partial sequence is presented as SEQ ID 1 NO: 

1 ■ i 
Example 8 1 ( ! 1 

A second EcoRI fragment from the positive clone described in Example 7, supra , was 
also subcloned. There was a great deal of homology, but the sequences were not identical. 
To be specific, intron 1 of this sequence was 98% identical to SEQ ID NO: 8, intron 2 was 
15 100% identical and intron 3 was 92% identical. 1 

What is striking about the sequences is that the promoters are not at all homologous, 
suggesting independent regulation. The 5* untranslated regions are 92% identical. The first 
exon for TIFa is split into exon la and exon lp. The first coding exon (which is exon lb for 
TIFa and exon 1 for TIFp) are 99.5% identical, while the second exons are 100% identical, 
20 the third exons 97% identical, the fourth exons 98.5% identical, and 96% for the fifth exon. 

In the untranslated 3 1 - region, homology is 96 % . 

Example 9 

Using the information described in example 8, supra , a cDNA sequence for the second 
clone, designated TIFp was deduced, and is set forth as SEQ ID NO: 9. The genomic DNA 
25 sequence was also ascertained, in the same manner as is described, supra , and is set forth as 

SEQ ID NO: 29. 

As compared to the coding region for TIFa, that of TIFp has six silent changes. There 
are two changes which result in an inconsequential amino acid change (at both of positions 36 

8 
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and 113, Val in TIFa becomes He in TIFp). There is, also a more significant change, at 

position 112, where Gin becomes Arg. ' , , , 4 ^ t ( 

• ' i ' 1 . , ' ,i 

■ > i t * * 1 1 

» ! ! 1 I ■ ■ 

Example 10 1 ' ' i ' \ ' ' ' ' 

Experiments were undertaken to study expression of the TIFs. BW 5147 cells were 

! . ' . » r *, ■ 

5 stimulated with recombinant murine IL-9 (200U/ml), for, varying periods of time (0.2, 0.5, 

1, 2 & 24 hours). Total RNA was then isolated, using standard methods and reagents. 

Reverse transcriptipn was then carried out, using 5/xg total RNA and' an oligo (dT) primer. 

i i 1 

Samples of cDNA corresponding to 20ng of total RNA were then amplified for 25 cycles 
using different primers. (One cycle was 4 minutes at 94^ °C, 1 minute at 57°C, and 2 minutes 
10 at 72°C). The TIF primers were: ' ' 

5'-CTGCCTGCTT CTCATTGCCC T-3' (SEQ ID NO: 10) 
and 

5-CAAGTCTACC TCTGGTCTCA T-3 f (SEQ ID NO: 11) 

1 

(sense and antisense, respectively). 
15 These correspond to nucleotides 106-126, and 764-784 of SEQ ID NO: 7, respectively. 

As a control, p-actin was amplified as well, for 18 cycles (first cycle: 4 minutes at 94°C, 1 

minute at 60°C, 2 minutes at 72°C. Succeeding cycles were 1 minute at 94°C, 1 minute at 

60°C, 2 minutes at 72°C). 

Following amplification, post PCR products were analyzed on a 1 % agarose gel, and 
20 specific amplification was confirmed, following blotting, using internal radioactive probes. 

The probe for TIF was: 

5 ! -G ACGC AAGC A TTTCTC AGAG-3 1 (SEQ ID NO: 12) 

the conditions and probes set forth were not specific for one or the other of the forms of TIF; 
however, the amplification product of TIFa contains a Kpnl restriction site, while the 
25 restriction site for TIFP does not. Digestion of the amplification products with Kpnl indicated 

that most, if not all, of the TIF mRNA induced by IL-9 was TIFa, suggesting that the TIFa 
expression was induced rapidly via the IL-9. The mRNA for TIFa was detectable after 30 
minutes of stimulation, and reached a plateau over a 1-24 hour time period. 

Example 11 

9 
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Experiments were then carried out which showed that the induction of TIF mRNA by 
IL-9, described supra , does not require protein synthesis. In thesfe, experiments, total RNA 
was extracted from cells stimulated for 24 hours, as described irti example 10; but with or 
without 10/ig/mlof a protein synthesis inhibitor, cycloheximidje, for 4.5 hours ^ In a parallel 
5 set of experiments, cells were not stimulated. The total RNA was extracted, and RT-PCR 

amplification was carried out as described in example 10.' Post : PC;R products were analyzed 
, on an ethidium bromide-stained, j 1 % agarose gel. What was seen, was thiat the induction by 
IL-9 still occurred when protein synthesis was blocked. Hence, the effect olf IL-9 is a direct 

effect, not requiring the synthesis of a protein mediator. , ■ 1 ' < 

f ' ' i 1 1 i i 1 ' 1 

10 Example 12 ' ' ' ( ' 1 '. ',' 1 ' n 1 \ < 

( In these experiments, the role of STAT proteins in induction of TIF mRNA was 

. studied on derivatives of the cell line BW5147. The First line, BWh9R, expresses wild type 

human IL-9 receptors. The line BW-Phell6 is a transfectant with a single mutation (at 

i 1 i , 1 

position 116), which renders the receptor unable to activate STAT transcription factors. Still 

15 another cell line, BW-mut6, has a mutation which renders the receptor unable to activate 

STATS i while retaining the ability to activate STAT1 and STAT3. Finally, cell line BW-mut7 

has a single mutation which renders the IL-9 receptor unable to activate STAT1 and STAT3, 

but which retains the ability to activate STATS. 

Cell stimulation, isolation of total RNA, reverse transcription and amplification of 

20 cDNA were all carried out as described in example 10 (Cells were stimulated for 24 hours. 

i 

Both human and murine recombinant IL-9 were used). The PCR products were analyzed on 
an ethidium bromide stained, 1 % agarose gel, as describe supra . 

The analysis revealed that human IL-9 did not induce expression in BW-Phell6, 
suggesting that STAT transcription factors are implicated. It was found that IL-9 induced TIF 
25 expression in the BW-mut6 mutant, but not the mut7 variant, suggesting that STAT1 or 

STAT3 are involved, but not STATS. 

Example 13 

The expression of TIF mRNA in normal mouse spleen cells was then studied. 
Spleen cells from 10-12 week old Balb/c mice were cultured for 24 hours in control 
30 medium or the control medium supplemented with 20jig/ml of LPS (which activates B 

10 
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lymphocytes and macrophages), or ConA (which activates T cells), or ConA plus 1% of a 
blocking antiserum against murine IL-9, with p actin being used as a control. Purification of 
RNA, RT-PCR analysis were carried but 'as described suera. 

The data indicated' that TIF is, at best, very weakly expressed in resting spleen cells, 
5 not induced by LPS, but strongly induced by ConA r Anti IL-9 antiserum did not affect 

induction by ConA, suggesting that its effect is not mediated by IL-9, or is mediated by other 

cytokines. , , . , 1 i 

When the ConA activated spleen cells were analyzed using sequences of RT-PCR 

i i ' ■ 
products, it was found that these cells were expressing TIFa predominantly, or exclusively. 

i , 1 
' • << ' 

10 Example 14 ' ' 

Further experiments showed that TIF mRNA was expressed even in the absence of IL- 

9 induction. 

Spleen cells from 5 week old FVB mice were' enriched for T cells, using a nylon wool 
column. Then, the cells were stimulated for 24 hours in meciium supplemented with ConA 
15 (a T cell activator), or PMA (which activates PKC in most cells), either with or without IL-9. 

Total RNA was isolated using standard techniques, and then ten microgram samples 
were fractionated via electrophoresis on a 1.3% agarose gel containing 2.2M formaldehyde. 
The fractions were then transferred to a nitrocellulose membrane, labeled, and assayed in a 
hybridization assay following Van Snick, et al, J. Exp. Med. 169: 363 (1989), incorporated 
20 by reference. 

The results indicated that the induction of TIF by ConA was not modified, and that IL- 
9 did not induce TIF RNA in PMA activated spleen cells. 

Example 15 

The expression of TIF mRNA in various cell lines was tested. In these experiments, 
25 murine cell lines were stimulated for at least one day, with a particular cytokine. Specifically, 

9T7 is a T cell lymphoma, which responds to IL-2, IL-4 or IL-9. Cell lines TS3 and TS6 are 
derived from T helper cell clones, and proliferate in the presence of either IL-2 or IL-9. MC9 
and LI38 are mast cell lines, which proliferate in the presence of either IL-3 or IL-9. 
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Following stimulation, total RNA was prepared using standard guanidium 
isothiocyanate lyses, and CsCl gradient centrifugation. , ' ' ( ' 

The 9T7 line was then analyzed ( by ^Northern blotting, as Ascribed in example 14', 
while the other lines were assayed using RT-PCR analysis^ as described supra. " ( [ , 
5 It was found that IL-9 upregulated TIF expression in T helper cells and mast cells, 

while IL-2 and IL-3 did not. the 9T7 cell line, however, shbwed roughly the same level of 

i . r , ■ ■ , , . .» • ■ 

expression, regardless of the cytokine, indicating that 1 ltl-9 is not mandatory for TIF 

1 1 ■ 1 , • 1 1 ' , 1 ,,v ' ' ,,, I-./ 

expression. 1 , , ! , 1 

Example 16 ' ' , 1 1 • 1 n 1 ' ■< 

10 j The expression of TIF mRNA in B cell lines was then studied. The cell lines A2b, 

70Z/3, and BCL-1 are B cell leukemia cell lines which grow, in vitro , without cytokines. 

These cells were stimulated for 24 hours with IL-4 and IL-9 and total RNA was isolated, 1 using 

i' i , 1 

standard methods. Expression was analyzed by RT-PCR which was carried out for 35 cycles, 

followed by blotting and hybridization, as described supra . 1 

15 t The results indicated that TIF expression is detectable in B cells, but is weakly 

upregulated at best in the presence of IL-9 and IL-4. 

Example 17 

Experiments were then carried out to study expression of the inventive molecules in 
T helper cell lines. TS2 and TS1 are known T helper cell lines, derived from T helper cell 
20 clones, which proliferate in the presence of either IL-9 or IL-2 (TS2), and either IL-9 or IL-4 

(TS1). Specifically, TS1 or TS2 cells were grown in the presence of the listed cytokines for 
at least 10 days, after which RNA was extracted using known methods. Expression of the 
molecules was studied via RT-PCR (35 cycles), using the protocols described supra . In TS1 
cells both IL-4 and IL-9 induce TIF expression, but IL-2 does not do so in TS2 cells. 

25 Example 18 

Expression of TIF mRNA in various mouse organs were studied. Total RNA was 
prepared from liver, kidney, heart, brain, intestine, spleen, thymus, lung, muscle and bone 
marrow, using standard guanidium isothiocyanate methodologies and CsCl gradient 

12 
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centrifugation. Forty cycles of RT-PCR were carried out, using the protocols described supra. 

Strongest expression was found in thymus tissue, while less intense signals were found in brain 

... . i," i 1 * ' 

tissue, and weaker expression in the remaining tissues: i-<, 



Example 19 ' .' u . * " 1 ' 

5 | . j The following experiments describe production of TIFa in 293-EBNA cells. 1 

Complementary DNA for TIFa was described' supra .. It was subcloned into a 
commercially available expression vector pCEP-4, in operable linkage with a CMV promoter. 
The resulting plasmids were transfected into 293-EBNA cells, using standard lipofectamine 
i methods. Following transfection, the cells were incubated in a methionine, free medium, 
10 supplemented with 35 S labeled methionine, for 24 hours. Supernatant was harvested, and run 

on an acrylamide gel, followed by electrophoresis. The gel was then dried and exposed to 
autoradiography for 1 day. A control was then run by transfecting cells with the same 
plasmid,, in which the cDNA was cloned in the antisense direction. | 

A heterogenous band of about 25-30 kilodaltons was found from the cells transfected 
15 with TIF in the sense direction. Any discrepancies between the predicted molecular weight, 

the actual molecular weight in the system, and the heterogeneity, can be attributed to 
glycosylation. In a series of parallel experiments, cDNA encoding human TIF was expressed 
in the same way as the murine cDNA was expressed. With the exception of the change of the 
cDNA, all experimental parameters were the same. 

20 Example 20 

Further experiments were carried out to study production of TIFa in COS cells. 
Specifically, TIFa cDNA was subcloned into the plasmid pEF-BOS.puro described by 
Demoulin et al. , supra , in operable linkage with the EF-la promoter. The plasmid cDNA was 
transfected into COS cells, using the same lipofectamine method described supra. The cells 

25 were incubated in methionine free medium, supplemented with 35 S methionine for 24 hours, 

after which supernatant was treated as described in example 20, supra . Again, a heterogenous 
band of 25— 30 kilodaltons was observed, as well as an 18 kilodalton band, which probably 
represents a non-glycosylated form of the molecule. 



13 



BNSDOCID: <WO 0024758A1_I_> 



WO 00/24758 , , "'. PCI7US99/24424 

I , ' 

Example 21 

In these experiments, it was discovered that TIF induces STAT activation in mesangial, 
neuronal melanoma, and hepatoma cells'^ It is kndwri that when cytokines activate, STAT 
factors, the factors dimerize, move from cytoplasm to the nucleus; and bind to target 
5 sequences in promoters. The details of the experiments follow. 

Transfected 293 r EBNA cells as described supra were used following incubation in 
normal medium for 48 hours, as were supernatant from the controls, also described supra . 
Samples of a mouse kidney mesangial cell line, ("MES13" hereafter), a ratpheochroniocytoma 
cell line, ("PC12" hereafter), four different human melanomas (SK23, AUMA^ NA-8mel and 

10 MULL), human heptaoma (HepG3) and rat hepatoma (H-4-II-K) were used.* Cell samples' 

(0.5xl0 6 ) were stimulated for 5-10 minutes in the presence of 1 % of , supernatant Nuclear 
extracts were then prepared, in accordance with Demoulin et al. . Mol. Cell. Biol. 16: 4710 
( 1 996) , incorporated by reference . In brief, cells were washed with PBS and then resuspended 
in 1 ml of ice cold hypotonic buffer for 15 minutes. (Buffer was lOmM HEPES buffer, pH 

15 7.5, with lOmM KC1, lmM MgCl 2 , 5% glycerol, 0 5 mM EDTA, O.lmM EGTA, 0.5mM 

dithiothreitol, and lmM Pefabloc, lmM Na 3 V 4 , and 5mM NaF). Cells were then lysed by 
adding 65 /d of NP-40, followed by vortexing. Nuclei were pelleted, by vortexing for 30 
seconds at 14,000 rpm, followed by extraction in buffer supplemented with HEPES (20mM), 
glycerol (20%), and NaCl (420mM). Nuclear debris was removed by centrifuging for 2 

20 minutes. DNA binding activity was determined in accordance with Demoulin et aL . supra . 

using a 32 P labeled double stranded oligonucleotide called "GRR," which contains the STAT 

binding site of the FcyRI gene promoter, i.e.: 

5 1 ATGTATTTCC CAGAAA-3' (SEQ ID NO: 13) 

and 

25 5 ' -CCTTTTCTGG GAAATAC-3 1 (SEQ ID NO: 14) 

corresponding to the upper and lower strands of the binding sites in the GRR probe. Briefly, 
5/x\ volume of nuclear extracts were incubated in binding buffer (12mM HEPES, pH 7.6, 
lOmM KC1, 0.5mM EDTA, 2.5% glycerol, O.lmg of poly(dl-dC) per ml) for 5 minutes. 
Radiolabeled GRR probe (10 5 cpm; approximately 0.5ng) was added, and incubation was 

30 continued for 25 minutes before loading onto a non-denaturing polyacrylamide gel. 
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It was also noted that the complexes observed in MES13 cells, described supra , were 

partially overshifted by both anti-STAT5 and anti-STAT3 antibodies, showing that (i) the cells 

under examination were targets for TjlF, and' (ii) that STAT3 and STATS are major 

1 1 1 ' i 1 • j . i 

components of the complex activated by TIF. The difference in' STAT profile, as compared 

5 to the profile in Example 12, supra , is attributable, to the difference in cell source (human 

versus mouse). It was also observed that human TIF works on' murine cells, and vice versa. 

■. i • ■ 

Example 22 1 ' ' 

i , » 

This example details the isolation and cloning of a nucleic acid molecule which encodes 

• , .i 

human TIF. First, human peripheral blood mononuclear cells were prepared via standard 
10 density gradient centrifugation. Following this preparation, samples were cultured for 24 

hours, at 3 x 10 6 cells/ml, either with or without anti-CD3 monoclonal antibody (The antibody 
was the commercially available OKT3 niAb, usee! in the form of ascites fluid at 1/500 

dilution). This antibody was used because T cell derived cytokines are generally expressed 

i 

only upon activation by e . g . , CD3 specific antibodies . 
15 Total RNA was isolated from these cells, using standard guanidine-isothiocyanate / 

CsCl ultra-centrifugation techniques. Following isolation, 10//g samples of the RNA were 

reverse transcribed using an oligo (dT)15 primer. 

Following preparation of cDNA, as outlined supra , samples which corresponded to 

lOOng of total RNA were amplified, via PCR, using the following primers: 
20 5" - AGCTGCTCAA CTTCACCCTG GA - 3 1 (SEQ ID NO: 15) 

5* - CCACTCTCTC CAAGCTTTTT CA - 3' (SEQ ID NO: 16) 

which are based upon a murine cDNA sequence, (i.e., SEQ ID NO: 7). The PCR conditions 
involved 30 cycles of amplification, with one cycle defined as 1 minute at 94° C, followed by 
1 minute at 42 °C, and then 2 minutes at 72°C. Amplification product was separated on an 
25 agarose gel, using standard methods, and then sequenced. The result indicated that fragments 

of the cDNA had been amplified. Hence, a second reaction was carried but, using the same 
materials except SEQ ID NO: 16 was replaced by SEQ ID NO: 17, i.e.: 
5' - CAAGTCTACC TCTGGTCTCA T - 3* 

This second PCR reaction was carried out for 25 cycles, with one cycle being defined as 1 
30 minute at 94°C, followed by 1 minute at 45°C, and then 2 minutes at 72°C. The 

15 
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amplification product was subjected to the same steps as the f first one. Again, fragments of 
cDN A were amplified. , , / 1 

Example 23 , , . ' 1 . i ■ 1 , ; ; f - m ., , ; 

Following preparation of amplification product, the 5,' end of cDNA was isolated by 
5 using standard, 5* - RACE techniques. In brief, first strand cDNA was prepared, by using 

SEQID NO: 18 as a primer, i.e. : [ i Vj ; 1 1 , ( 

5\ - TGGCCAGGAA GGGCACCACC t - V V ' 

This primer was based upon the sequence information obtained in accordance >vith example 
22. In brief, the 5' - RACE method was carried out by combining l' jig of total. RNA, 

10 prepared as described supra , 2.5 pmoles of SEQ ID NO: 18, reverse transcriptase , reverse 

transcriptase buffer, 2.5 fA of dNTP mix (10 mM), 2.5 fA of MgCl 2 (25mM), and 2.5 \j\ of 
dithiothreitol (0. 1 M). The reaction was carried out and, after completion, original RNA was 
removed via adding RnaseH, and Rnase TI. Any unincorporated dNTPs, as well as primer 
and proteins, were removed. The cDNA was tailed using terminal transferase, or "TdT." 

15 This enzyme creates a 3' -binding site for the abridged anchor primer, as described infta. 

Tailing was carried out by combining the purified, first strand cDNA, TdT, buffer (10 mM 
Tris-HCl, 25 mM KC1, 1.5 mM MgCl 2 ), and 200 >uM of dCTP. 

Following the tailing reaction, PCR was carried out using 
5* - TGGCCAGGAA GGGCACCACC T - 3 1 (SEQ ID NO: 19), 

20 and 5* - RACE abridged anchor primer: 

5 ! - GGCCACGCGT CGACTAGTAC GGGIIGGGIIGGGIIG - 3' (SEQ ID NO: 20). 
The amplification involved 35 cycles (1 cycle defined as 1 minute at 94°C, 1 minute at 56°C, 
and 2 minutes at 72 °C). Following this, nested amplification was performed on 5 /A of a 
1/100 dilution of the amplification product, using SEQ ID NO: 19 and the abridged universal 

25 amplification primer: 

5' - GGCCACGCGT CGACTAGTAC - 3 f (SEQ ID NO: 21). 

Amplification involved 30 cycles (1 cycle being defined as 1 minute at 94 °C, 1 minute at 
56°C, and 2 minutes at 72 °C). The resulting PCR product was cloned, following standard 
procedures, and sequenced. 
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These three protocols, i.e., the two experiments , described supra which generated 
fragments, and the 5' - RACE PCR, also described supra , permitted alignment of the 

I, r 1 * t j I ' ( 

sequenced amplification product, to generate the complete sequence. 

^ Following the alignment, oligonucleotides 'were generated which flanked the deduced 
5 open reading frame, i.e.: . M ( ' , - 

' ' 1 ' ' ' ; .... ' '. , < ;!>' ' , ' ' ' 

5' - CCTTCCCCAG TCACCAGTTG - 3[ (SEQ ID NO: 22) , ( ' 1 / , 

and i i 

5* - TAATTGTTAT TCTTAGCAGG - 3* (SEQ ID NO: 23), t 

These primers were used to amplify the entire open reading frame, using "mRNA from CD3 
10 specific mAb stimulated cells, as described supra . For amplification, 25 cycles (1 cycle being 

defined as 1 minute at 94°C, 1 minute at 56°C, and 2 minutes at 72°C). 

t > The, complete sequence of the human cDNA is set forth at SEQ ID NO: 24. 

As with the murine sequence, there are potential start codons at positions of SEQ ID 

NO: 24 which correspond to amino acids 1 and 13, as well as codons corresponding to 
15 methionine at amino acid positions 58, 85, and 92. The possible initiator codons correspond 

to proteins with calculated molecular weight of 19,998 daltons, and 18,735 daltons 

respectively (for 176 or 167 amino acids, respectively). As with the murine form of the 

protein, hydrophobic leader sequences are seen, indicating an N-terminal signal sequence of 

from about 20 to about 40 amino acids. 



20 Example 24 

These experiments detail work on the isolation of human genomic DNA corresponding 
to the cDNA discussed supra . 

Based upon the cDNA sequences, primers were developed which correspond to 
nucleotides 51-70 and the complement of nucleotides 631-650 of SEQ ID NO: 24. PCR was 

25 carried out, using standard methodologies. Specifically , lOOng of genomic DNA was used as 

a template, and 33 cycles of amplification were cararied out (one cycle of amplification being 
defined as 94° C for 30 seconds, 50° C for 30 seconds, and 72° C for 5 minutes). Once a 
sequence was isolated, it was sequenced, and this is set forth as SEQ ID NO: 25. The 
sequence is about 4.8 kilobases in length, and is believed to contain the entire genomic 
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sequence encoding the TIF molecule, lacking only the 5' flanking region, the promoter, and 




Example 25 1 ' d;. i '/-',< 1 1 ; ' 1 ' ' ' ■ 

It was of interest to identify where the genomic DNA discussed supra was located in 
the human genome. In order to do this, two different approaches were taken 1 . In^the first, the 
sequence discussed supra , i.e., SEQ ID NO: 25; was labeled with a flourescerit label, and then 
was used to probe the human genome via fluorescent, 'in situ hybridization ("FISH") using 
standard methods. 1 i , 1 , 

In a second approach, a panel of radioactive hybrid clones w f ere screened using the 
probe consisting 1 of nucleotides 51-70 of SEQ ID , NO: 24! and 1 S'-AJCAGATGGA 
TTACTGA ATG-3 1 (SEQ ID NO:26). PCR was carried out using 25 ng of genomic DNA as 
a template, for 35' cycles, where one cycle is defined as 94 °C for in minute, 55° C for i 
minute and 72 °C for 2 minutes. ' 

Both methodologies indicated that the gene is located at chromosome 12ql5. Some 
work links diseases associated with asthma at this site. See, e.g. Nat. Genet. 15:389-392 
(1997); Ober, et al, Hum. Mol Genet. 7(9): 1393-1398(1998); Nickel, et al, Genomic 
46(1): 159-162(1997); Takahashi, etal, Genomics 44(1): 150-2(1997); Barnes, etal, Genomics 
37(1):41-50(1996), all incorporated by reference.. 

Example 26 

These experiments describe the manufacture of antibodies which bind to the TIF 
protein. To make these, a peptide consisting of amino acids 40-61 encoded by SEQ ID NO: 
7 was coupled to KLH carrier protein, using standard methods and a ratio of 1 mg peptide to 
1 mg carrier protein. Subject animals (rabbits), were immunized 3 times, at 2 week intervals, 
with 150 fig of the complex. The immunogen was emulsified in Complete Freund's Adjuvant 
for the first injection, and then Incomplete Freund's Adjuvant for the next two. 

A first bleed was performed one month after the last injection, and serum was 
prepared, following known methods. 

The serum was then tested in a standard Western Blot. In brief, 10 pi of supernatant 
from cells transfected with either SEQ ID NO: 7 or SEQ ID NO: 24 were separated via SDS- 



18 



WO 00/24758 1 PCT/US99/24424 

i 

PAGE electrophoresis, and then blotted onto PVDF membranes. Antiserum was diluted to 
1:500, and used in a standard Western Blot protocol, together with anti-rabbit antibody as the 
secondary antibody, and a commercially available detection* kit. • 
It was found that the serum did, in fact, recognize the TIF protein. 
5 In figure 1 , the deduced amino acid sequences of murine and human TIF are set out. 

The high degree of homology is seen in the boxed regions. , ( 

The foregoing examples describe the invention, one aspect of which' are isolated 
nucleic acid molecules, which encode TIF proteins such as those with the amino, acid sequence 
of the protein encoded by the nucleotide sequence of SEQ ID NO: 7, 24 or 25. It will be 

10 appreciated by one of ordinary skill that the degeneracy of the genetic code facilitates the 

preparation of nucleic acid molecules which may not be identical to the nucleotide sequence 
of SEQ ID NO: 7, 24 or 25, but which encode the same protein. Of course 1 , SEQ ID NOS: 
7, 24 and 25 are preferred embodiments of this invention, but other embodiments are also a 
part of the invention. Genomic DNA, complementary DNA, and RNA, such as messenger 

15 RNA, are all to be included therein. Isolated nucleic acid molecules from other animal 

species, including other mammals, are also a part of the invention. A preferred aspect of the 
invention are isolated nucleic acid molecules whose complements hybridize to SEQ ID NO: 
7, SEQ ID NO: 8, SEQ ID NO: 9, or SEQ ID NO: 24 under stringent conditions. "Stringent 
conditions," as used herein, refer, for example, to hybridization at 65°C in buffer (3.5xSSC), 

20 0.02% Ficoll, 0.02% polyvinylpyrrolidone, 0.02% bovine serum albumin, 25mM NaH 2 P0 4 

(pH 7), 0.1% SDS, 2mM EDTA, followed by a final wash at 2xSSC, room temperature and 
then 0.1xSSC/0.2xSDS at temperatures as high as, e.g., about 65°C. More stringent 
conditions, such as 0. lxSSC, can also be used. These nucleic acid molecules encode proteins 
of about 17-22 kD as determined by SDS - PAGE, which activates STAT proteins, such as 

25 STAT 1,STAT3 and/or STATS. In glycosylated form, these proteins can range from about 

17 to about 30 kilodaltons, as determined by SDS - PAGE. 

Also a part of the invention are expression vectors which include the nucleic acid 
molecules of the invention, operably linked to a promoter, so as to facilitate expression of the 
DNA. It is well within the skill of the artisan to prepare such vectors. 

30 The vectors, as well as the nucleic acid molecules per se, can be used to prepare 

recombinant cells, be these eukaryotic or prokaryotic, wherein either an expression vector or 
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the nucleic acid molecule itself is incorporated therein. E. coli cells, COS cells, CHO cells, 
etc. , are all examples of types of cells which may be used in accordance with this aspect of the 
. invention. ■ , - ' V.' | i 11 l ■ 

>-m Proteins encoded by the above referenced nucleic acid molecules, preferably in isolated 
5 form, are another feature of this invention. By "protein" is meant both the immediate product 

of expression of the nucleic acid molecules, glycosylated forms 1 of it, as well as multimeric 
forms, such as dimers, trimers, and so forth. Also a part,of the invention are ( multimers, such 
as dimers, which contain at least one protein molecule ,of the invention, and at least one, 
different protein molecule. Preferably, this different protein molecule is a cytokine, such as 
10 ' IL-1(D. Also included as a feature of the inventions are constructs, such as fusion proteins, 
where all or a part of the proteins described 1 supra are linked in some fashion, such' as in a 
fusion protein, to at least one addtional protein or peptide, or amino acid sequence, The 

"fusion partner" may be, for example, a molecule which provides a recognizable signal^ either 

i' t i , ' 

directly or indirectly, such as a, FLAG peptide, p-galactosidase, luciferase, and so forth. 

15 These fusion partners are preferably joined to the molecule which is described supra at the N- 

and/or C- terminus of the protein; however, it is to be understood that there are many 
techniques known for joining molecules to amino acids, and any and all of these methodologies 
can produce constructs which are a part of the invention. 

The individual protein molecules of the invention, as noted supra , will preferably have 

20 a molecular weight of from about 17 to about 30 kilodaltons, as determined by SDS-PAGE. 

In multimeric forms, the molecular weight of the complex will, of course, vary, but the TIF 
molecules contained therein will each have a molecular weight of about 17 to 30 kilodaltons, 
as determined by SDS-PAGE. 

The proteins preferably consist of at least about 120 and no more than about 200 amino 

25 acids. Preferably, the amino acids sequences consists of or comprises all or part of the amino 

acid sequences encoded by SEQ ID NOS: 7,8, 9, 24 or 25. More preferably, the amino acid 
sequence contains all but about the first 40 amino acids encoded by said SEQ ID's. Even 
more preferably, it contains all but about the first 20 amino acids encoded by these sequences. 
Most preferably, the protein comprises amino acids set forth at SEQ ID NO: 27 or 28. 

30 It will be appreciated by the skilled artisan that the proteins encoded by the above 

recited nucleic acid molecules are a feature of the invention, and may be used to produce 
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antibodies, in accordance with standard protocols. Such antibodies, in monoclonal and 
polyclonal form, constitute a further feature of the invention as do fragments of said 
antibodies, chimeric forms, humanized forms, recombinant forms, and so forth. 1 'Also a 
feature of the invention are immunogens, comprising all or a part of the amino acid sequence 
5 protein molecules of the invention, preferably combined with an adjuvant, such as Complete 

or Incomplete Freund's Adjuvant. Portions of the protein sequences may be linked to other 
molecules, such as keyhole limpet hemocyanin, to 'render them more immunogenic. These 
antibodies can be used, e.g., to determine if the proteins of the invention are present. ' This 
is a further feature of the invention, as is now explained 1 . It has been shov^niin the examples, 

10 that the nucleic acid molecules of the invention were expressed in the presence of the IL-9. 

Hence, a further feature of the invention is a method to determine if IL-9 is or has been 
present, wherein one detects either the proteins of the invention, using antibodies for example, 
or mRNA using the nucleic acid molecules of the invention, as probes. The mRNA can be 
determined directly, or in the form of cDNA. Such probes may or may not be labeled, as a 

15 matter of choice for the user. Hence, one can determine, for example, if, following 

administration of IL-9, the cytokine is still efficacious, by determining if the nucleic acid 
molecule of the invention is present. This type of assay can be adapted, for quantitative 
studies, wherein one determines, for example, either if a cell is sensitive to IL-9, and if so, 
how sensitive it is. One can also use the proteins of the invention to phosphorylate STAT 

20 proteins such as STAT1 , STAT3 and/or STAT 5. This in turn results in dimerizatiori of the 

STAT protein, followed by migration to the nucleus to provoke the effect that these STAT 
proteins have on cells. 

One could also use these molecules to test the efficacy of IL-9 agonists or antagonists 
when administered to a subject, such as a subject suffering from lymphoma , an immune 

25 system disorder such as an allergy, acquired immune deficiency syndrome, autoimmune 

diabetes, thyroiditis, or any of the other conditions described in , e.g, US Patent No. 
5,830,454; 5,824,551, and pending application Serial No. 08/925,348, filed on September 8, 
1997 now allowed, all of which are incorporated by reference. The molecules can also be 
used to mediate the role of IL-9 in these and other conditions. To elaborate, since IL-9 

30 induces TIFs, the TIFs are useful as IL-9 activity mediators. Thus, a further aspect of the 

invention is a method to determine activity of endogenous IL-9, such as in situations where 
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excess IL-9 activity is implicated; such as asthmas, allergies, and lymphomas. One can also 

block or inhibit IL-9 activity by blocking or inhibiting TIF or TIF activity, using, e.g., 

antisense molecules, antibodies which bind to TIF, or other antagonists ofthese molecules 

For example, m uteins of; TIF, which bind to the 1 , TIF receptor but do not activate it, therby 

5 inhibiting IL-9 induced activity, are a feature of the invention. Examples of conditions which 

can be treated by the use of such TIF muteins are'allergies; asthma, and so forth:* Muteins in 

accordance with the invention can be made in accordance with,, e.g., Weigel et al Eur J 

Biochem 1 80(2) :295^300( 1989) and Epps, et al, Cytokine 9(3): 149-156(1997), both of wliich 

1 i i 

are incorporated by reference. Such muteins can be used in the treatment of asthma, allergies, 

10 or both. Further, it will be clear to the skilled artisan that the models set forth, supra , can also 

be used to screen for appropriate muteins/ The ability to regulate IL-9 activity is important 
in conditions such as those listed supra , as well as conditions such as apoptosis, including 
Cortisol induced apoptosis, conditions involving the nuclear expression of BCL-3, since IL-9 
is known to induce such expression, and so forth. "Antibodies, " as used herein, refers to any 

15 portion of an antibody which binds to TIF, including chimeric and humanized antibodies. 

Another feature of the invention relates to the ability of the TIF type molecules of the 
invention to either promote regeneration or inhibit differentiation of tissue types on which the 
molecules are active. As was shown, supra , the TIF molecules target various cancer and 
normal cell lines (i.e. , mesangial and neuronal cells, as well as melanoma and hepatoma cells). 

20 Hence, one can stimulate regeneration of tissue via, e.g., adding an amount of a TIF type 

molecule to a sample in need of regeneration of a tissue acted on by the TIF molecule. This 
approach can be used both in vitro, and in vivo. Similarly, antagonists of TIF may be added 
when the situation is one where the aim is to inhibit differentiation of a particular type of 
tissue, such as melanoma or hepatoma. 

25 The genes which encode TIF, as noted in Example 25, supra , are located on 

chromosome 12. This chromosome is associated with asthma, as is known in the art. Hence, 
a further embodiment of the invention is a method for determining susceptibility to conditions 
such as, or related to asthma, by determining if aberrations, such as polymorphisms, deletions, 
additions, etc. , are present at the site of the TIF gene. Such aberrations may be an indicia of 

30 susceptibility to, or of the presence of, asthma, an allergic condition, or one or more related 

conditions. The ability to detect aberrations in a DNA sequence is well known in the art, and 
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such methods need not be set forth herein. Preferably, the aberration or aberrations is detected 
via standard techniques, such as PCR, using the methodologies and primers referred to supra . 

i 1 , ( i ■ ! | 

Other features, of the invention will be clear to the artisan and need not be discussed 
further. ' ' 

The terms and expressions which have been employed are used as terms of description 
and not of limitation, and there is no intention in the use of such terms and expressions of 
excluding any equivalents of the features shown and described or portions thereof, it being 
recognized that various modifications are possible within the scope of the invention. 
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Claims i i 1 

1 1 , t ■ . ■ " 1 , ■ 

■■ ..: i. " 1 1 ' . ■ ' ' ' • ' 

I ■ ■ : ' • ■ 1 

1. An isolated nucleic acid molecule which 4 encodes alt \ cell derived inducible factor the 
, M complementary sequence of which hybridizes', under stringent 1 conditions » to at least 

one of SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:29, SEQ ID 
NO: 24 or SEQ ID NO:25. ( | , - lt 

2. ^ The isolated nucleic acid molecule of claim 1, wherein said, isolated nucleic acid 
1 molecule encodes a protein having the amino'acid sequence of the protein encoded by 

- SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID,NO: 9, SEQ IE) NO: 29, SEQ ID NO: 24 or 
SEQ ID NO: 25. ' 

3. The isolated nucleic acid molecule of claim 1, wherein said molecule is cDNA. , 

V i . 1 

4. The isolated nucleic acid molecule of claim 1 , wherein said molecule is 1 genomic DNA. 

i 

5. } The isolated nucleic acid molecule of claim 2, the nucleotide sequence of which 

consists of the nucleotide sequence SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, 
SEQ ID NO: 29, SEQ ID NO: 24 or SEQ ID NO:25. 

6. The isolated nucleic acid molecule of claim 4, having the nucleotide sequence of SEQ 
ID NO: 25. 

7. An isolated nucleic acid molecule which encodes the protein encoded by the isolated 
nucleic acid molecule of claim 1. 

8. Expression vector comprising the isolated nucleic acid molecule of claim 1, operably 
linked to a promoter. 
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.' . . .' ' ■•• ',' ' 

Expression vector comprising the isolated nucleic, acid molecule of claim 2, operably 
linked to a promoter. 



t 



i(f. Expression vector comprising the isolated nucleic abid molecule of claim 3,' operably 



linked to a promoter. 



11. Expression vector comprising th'e j isolated nucleic acid molecule of claim 4, operably , 



linked to a promoter. j , 



1 i 1 / 

L 



12. ' Expression vector comprising the isolated nucleic acid molecule of claim 5, operably 
linked to a promoter. ( ' \ ' < 

i 

13 ( . Expression vector comprising the isolated nucleic acid molecule of claim 6, operably 
linked to a promoter. , ' ' t 

14. Recombinant cell comprising the isolated nucleic acid molecule of claim 1. 

15. Recombinant cell comprising the isolated nucleic acid molecule of claim 2. 

16. Recombinant cell comprising the expression vector of claim 8. 

17. Recombinant cell comprising the expression vector of claim 9. 

18. Recombinant cell comprising the expression vector of claim 10. 

19. Recombinant cell comprising the expression vector of claim 1 1 . 

20. Isolated protein encoded by the isolated nucleic acid molecule of claim 1, and having 
a molecular weight of about 17-30 kilodaltons as determined by SDS - PAGE. 
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21 . The isolated protein of claim 20< comprising at, least 120 amino acids of the protein 
encoded by SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID r NO:29, SEQ ID 
NO:'24,orSE^iDN6: 25.".'' ?., '" ': '' ' "/" ".U '"„' ' ' 1 ' '' " " „ 

' ■ 1 '■' ■ V- 1 ' /, ' , . ■ 

22. The isolated protein of claim 21, comprising at least all but the 40 N terminal amino 
acids encoded by SEQ ID NO:, 7, SEQ; ID NO: 8,, SEQ ID NO: 9, SEQ ID NO:29, 

SEQ ID NO: 24, of SEQ'ID NO: 25. ' ,i "■"[•< 

' 1 1 , ,"• "■ - ■■ 

i I ■ i 1 

i , ■ J i- 

23. The isolated protein of, claim 22, comprising at'leas't all but the 20 N terminal amino 
acids encoded by SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:29^ 
SEQ ID NO: 24,' or SEQ ID NO: 25. '■ , ' 

11 .1 

24. Antibody which specifically binds to the isolated protein of claim 20. 

25. The antibody of claim 24, wherein said antibody is a monoclonal antibody. 

26. A method for determining effectiveness of interleukin L 9 on a cell, comprising 
contacting said cell with an agent specific for at least one of (i) an isolated nucleic acid 
molecule which encodes a protein whose amino acid sequence is identical to the amino 
acid sequence encoded by the nucleotide sequence of SEQ ID NO: 7, SEQ ID NO: 8, 
SEQ ID NO: 9, SEQ ID NO:29, SEQ ID NO: 24 or SEQ ID NO: 25 and (ii) a protein 
whose amino acid sequence is identical to the amino acid sequence encoded by the 
nucleotide sequence of SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID 
NO:29,SEQ ID NO: 24 or SEQ ID NO: 25 and determining interaction of said agent 
with (i) or (ii) as a determination of effectiveness of interleukin-9 on said cell. 



27. 



The method of claim 26, wherein said agent is an antibody which specifically binds to 
(ii). 
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1 I 
t I 

' , , j ' •■' 

28. The method of claim 26, wherein said agent comprises the isolated nucleic acid 
molecule of SEQ ID NO: 7, SEQ ID NOi8, SEQ ID NO:9, SEQ ID NO:' 29, SEQ ID 

NO:24, or SEQ ID NO: 25." ' ■', , ' }\ * * ' ' 1 ' , . * ' ' 

29. A method for stimulating activation of a STAT protein comprising administering an 
i i amount of the protein of claim 20 sufficient io stimulate activation of said 1 STAT 

protein. i , t , , , i , 

i i 1 , .' 1 , ' . ■ 

30. The method of claim 29, wherein said STAT protein is STATl , ,STAT3 or STAT 5. 

. •. i ■ i ■ ," • 1 i > 1 , • 

1 1 

31. A method of inhibiting activation of a STAT protein, comprising administering an 
amount of an antagonist of the protein of claim 20 sufficient to inhibit stimulation of 
expression of said STAT protein by said protein. 

32. The method of claim 31, wherein said STAT protein is STATl , STAT3 or STAT 5. 

33 1 A method for determining presence of TIF in a sample, comprising contacting said 
sample with an agent which binds to TIF or a nucleic acid molecule encoding TIF, and 
determining said binding as a determination of TIF in said sample. 

34. The method of claim 33, wherein said agent is an antibody. 

35. The method of claim 33, wherein said agent is a nucleic acid molecule. 

36. A method for screening to determine if a substance influences IL-9 activity, comprising 
adding said substance to a sample of TIF producing cells, in the presence of IL-9, and 
determining production of TIF, wherein a difference in production of TIF by said cells 
as compared to production of TIF by said cells in presence of IL-9 but not said 
substance indicates said substance influences IL-9 activity. 



WO 00/24758 | PCT/US99/24424 

37. The method of claim 36, wherein said substance is an IL-9 inhibitor or antagonist, said 
method further comprising determining lower levels pf TIF production by said cells in 
the presence of said substance as. compared to, its absence. i > 

1 1 1 i ' ' , \" ' 1 /i 1 ■ ' 

38. The method of claim 36, wherein said, substance is an IL-9 activator, said method 

i " ■ ,' ■ ■ 

further comprising determining, higher levels of TIF production by' said cells in the 

presence of said substance as compared to its absence. J ' 1 (! 

'* t ■ , - i 

39. A method for determining an aberrant level of IL-9 activity in a subject, comprising 

determining level of TIF in - a subject arid comparing said level to a normal level^ 

differences therebetween being indicative of ah aberrant level of IL-9 in said subject. 

i ■ 

i.i i 
11 .i 

40. The method of claim 39, wherein said aberrant level is excess endogenous IL-9. 

1 . ' 1 1 ' 

i it 1 

41. The method of claim 39, wherein said aberrant level is insufficient endogenous IL-9. 

42. The method of claim 40, wherein said subject suffers from asthma, an allergy, or 
lymphoma. ' 

43. A method for inhibiting IL-9 induced activity in a subject in need thereof, comprising 
administering an amount of a TIF inhibitor sufficient to inhibit IL-9 induced activity. 

44. The method of claim 43, wherein said TIF inhibitor is an antisense molecule. 

45. The method of claim 43, wherein said inhibitor is an antibody. 

46. A method for treating a subject suffering from asthma or an allergy, comprising 
administering to said subject an amount of a TIF mutein sufficient to alleviate said 
asthma or allergy. 

47. A method for determining if a mutein of TIF is therapeutically useful, comprising 
contacting a cell which produces IL-9 with said mutein, and determining effect of said 
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mutein on prouction of IL-9, reduction thereof being indicative of possible thereapeutic 
efficacy for said mutein. '.'>,' 1 ' . • ' •■ 

' i|i I t lil' i J M . . ' I » - , i • ' 'I i! 

'it 1 ' ,1 

,( • r . * ■ ■ „ . ,. 

48. A method for determining susceptibility 1 to a condition 1 characterized by aberrant 
expression of TIF, comprising determining nucleotide sequence of a TIF gene of a 
subject believed to possess an aberrant TIF gene,, presence of an aberrant TIF gene 

being indicative of possible susceptibility to asthma or allergy. , 

1 j 1 ' ' ' • * .« ' ' 

■ i * ■ . • , 

49. The method of claim 48, comprising contacting a sample taken from said subject with 

•■•I , "' , ■ > 

a pair of oligonucleotide primers which amplify, said TIF gene. , 
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TIFalpha cDNA 



1 » . M 



taaacaggctctcctctcacttatcaactgttgacacttgtgcgatctctgATGGCTGTCCTGCAdAAA 

ttcccttatggggactttggccgccagctgcctgcttrgtcattgccctgtgggcccaggaggcam 
tgcgctgcccgtca 

acacccggtgcaagcttgaggtgtccaacttccagcagccgtac'atcgtcaaccgcacctttatsct 
ggccaaggaggcc 1 • ' ' , \ ' ' , ' 

agcgttgcagataacaacacagacgtccggctcatcggggagaaactgttccgaggagtcagtgct 
aaxgatcagtgcta [ ' . , 

cctgatgaagcaggtgctcaacttcaccctggaagacgttctgctcccccagtcagacaggttcca 
gccctacatgcagg ' 

aggtggtacctttcctgaccaaactcagcaatcagctcagctcctgtcacatcagcggtgacgacca 
gaacatccagaag * , ' ' , 

Aatgtcagaaggctgaaggagacagtgaaaaagcttgga^ 

ggaa!ctggacctgct ' ' , , p 1 

GTTTATGTCTCTGAGAAATGCTTGCGTCTGAgcgagaagaag • 

agaacaataagatccctgaatggacttttttactaaaggaaagtgagaagctaacgtccatcatcattagaagatttcac 4 ' . 

atgaaacctggctcagttgaaaaagaaaatagtgtcaagttgtccatgagaccagaggtagacttgataaccacaaagat 

tcattgacaatattttattgtcactgatgatacaacagaaaaataatgtactttaaaaaattgmgaaaggaggttacc 

tctcattcctUagaaaaaaagcttatgtaacttcatttccatatccaatattttatatatgtaagtttatttattataa • ( 

gtatacattttatttatgtcagtttaUaatatggatttatttatagaaacattatctgctattgatatttagtataagg 

caaataatatttatgacaataactatggaaacaagatatcttaggctttaataaacacatggatatcataaaaaaaaaa 



TIFalpha Protein 



MAVLQKSMSFSLMGTLAASCLLLIALWAQEANALPVNTRCKLEVSNFQQPYIVNRTFMLAKEASLADNNT 
DVRL1GEKLF 

RGVSAKDQCYLMKQVLNFTLEDVLLPQSDRFQPYMQEWPFLTKLSNQLSSCHISGDDQNIQKNVRRLK 

ETVKKLGESGE 

IKAIGELDLLFMSLRNACV 



TIFbeta cDNA 



id AJO:S ) 



aacagoctctcctctcagttatcaacttttgacacttgtgcgatcggtgATGGCTGTCCTGCAGAAATCTATGAG Mill 

CCCTTATGGGGACTTTGGCCGCCAGCTGCCTGCTTCTCATTGCCCTGTGGGCCCAGGAGGCAAATG 

CGCTGCCCATCAAC 

ACCCGGTGCAAGCTTGAGGTGTCCAACTTCCAGCAGCCGTACATCGTCAACCGCACCTTTATGCTG 
GCCAAGGAGGCCAG 

CCTTGCAGATAACAACACAGACGTCCGGCTCATCGGGGAGAAACTGTTCCGAGGAGTCAGTGCTAA 
GGATCAGTGCTACC 

TGATGAAGCAGGTGCTCAACTTCACCCTGGAAGACATTCTGCTCCCCCAGTCAGACAGGTTCCGGC 
CCTACATGCAGGAG 

GTGGTGCCTTTCCTGACCAAACTCAGCAATCAGCTCAGCTCCTGTCACATCAGTGGTGACGACCAGA 
ACATCCAGAAGAA ^ 

TGTCAGAAGGCTGAAGGAGACAGTGAAAAAGCTTGGAGAGAGCGGAGAGATCAAAGCGATCGGGG 
AACTGGACCTGCTGT 

TTATGTCTCTGAGAAATGCTTGCGTCTGAgcgagaagaagctagaaaacgaagaacigctccttcctgccttctaaaaag 
aacaataagatccctgaatggacttttttactaaaggaaagtgagaagctaacgtccaccatcattagaagatttcacat 
gaaacctggctcagttgaaagagaaaatagtgtcaagttgtccatgagaccagaggtagacttgataaccacaaagattc 
aHgacaatattttattgtcattgataatgcaacagaaaaagtatgtactttaaaaaattgtttgaaaggaggttacctc 
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, '• . 1 , i . 1 

tcattcctctagaagaaaagcctatgtaacttcatttccataaccaatactttatatatgtaagtttatttattataagt, ' r ' 1 1 1 
atacattttatttatgtcagtttattaatatggamatttatagaaaaaittatctgatgttgatatttgagtataaagc '' ' < i ,i 

aaataatatttatgataataactatagaaacaagatatcttaggctttaataaacacatgaatatcataaa , . , 1 1 



TIFbeta Protein fee® rb a) a-, n) 




TIFalpha Genomic Sequence (Sen* /r> MO-, qj , 

-i 1 ' ' i . 1 ',1, 

GTCTATCACCTGCTTAAGATTCTTCTAATTTATAAAA 

GCACGTATTTA , ' 1 

TAGCATGGTGTTGTGACCATGCAGGTACAGAGTGGAATGGTAAGAGGCGCTATTATCAGCATTAACC 
AACATGTTAATGT 1 . 

TTTCTTCTGGCAAGCAAACTTGAAATCTATGTCTTAAACAATCTTCAAGCCTC 
ACTGGAGTCCG ' , , 

CTGCTGTCCAACAGAGCTCTTGAGCACGCTCTCCTCTGTTTGCAATTTTATGTTCTTTGATCGACTCC 

CCAACCTCTCAC 

CTTCGGCTCCTGATGGCCACCTTTCAACTTTCTGCATTTATGAACTCCATGTTTTAATC'I I 1 I I ATTAA 
AATATTCACAC ' 

AATCAGTGTTTGTGCAAGTCTGTTTCACCCACATGTATGtCTGTGCACCAAGTGCTGCCTGGTGCTT 
GTGGGGGCAAGGA ' 

GCAGGAGAGGGTGCCCTGGCACCGGAGTCACGGATGGTTGTGAGCCACCATGAGGATGCTGGGAG 
TTAGACCCAGGTCCT 

CCAGAAGTGCAGCAAATGCTCTTAACCACACGCAGGCATTTCTCTCTCCAGCCCCAACATGAGTGCT 
TTTAGATTCCACC 

TAGAATAGAGATCTGATGGCTTCACTCACTGCCACCTCCCCTTTGCATCTTTCTGCCAAGGAACACC 
AAAAAGCAAGAAT 

CCCCACACTGCTTTCGCTCCTCAAGTCTGCACCTCTCAACAGGTCAAGATTCTCCAGTGTCCCTCTA 
ACACTTTCCCCAG 

TGTCCCTCTAACACTTTCTCCAGTGTCCCTCTAACACTTTCTCCAGTGTCCCTCTAACACTTTTGATCT 
C AATT AG CTG A 

GGGGAGAAAGATCTCACACAGTGATTTTCATGACTTCGCGTTCTAGTCTAGATGTAGGCATTTGCGT 
GTC AGTCT AG G GT 

AGGCGTCTGCTCCCGCTGCTTAGGAAAGACTTTCCTAGTCTAGTTGTCAGGTGCTATCTGGGATTCA 
GTGTACATACAAT 

GCAAAAAATCCCAGTATTTTGTAAATTCTCTTCTTCAACTATCCATCTATATAGTATGTTA 
TCATTTAAAAA 

TAATATTTTGAGACTTATGCTTGCAQAAGTAAAAT^ 
TTTTAAAAAAA 

TCTATGCTTAAAATGTCTATTAGATTGTTCAC^ 
ATTTCAACCTT 

TGTATTTGCATCTACCATAACAGTCTCTGAACCAGAACATTCTGTGGCAATGGGAGCTGTGAAGAAA 
G CC AAC ATTCTTA 

TTAAAAAAAAAAAACAGCTAGTTATAGTTT^ 
TAAAAATTGA 
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AATAATCTCCAAGTTTTCATTATGGCTTA 

GGTCACTTCTA '\\' ' '\'' " ', V'' ;', ■ ' ' h ' ' ' ' ' " ,. 

AAGAGATAAGAATCTATGAAGTTGGTGGGAAAATGAGTCCGTG 

CGGGAGATCAAAG ; • " i' ( , ' . ' " »', 1 ' 

GCTGCTCTACTCAATCAGAATCTACTACGGCAkAGC^ 

TTTCTGGGATTT 

GTGTGCAAAAGCACCTTGTTGGCCCTCACCGTGACGTTlTAGGGAAGACTTCCCATCTCTCAAGGTG 
GGAAGGCTTGGAG 1 * '■ < ^ - ,/ ' 1 

GTGGTGTCTTGTGGCCTCCTATGGTGGTTAGGTACTTCJCAGAAGACAGGACTGGAAATTAG 
GTCTGATGTCATA ' < , i ' 

TCATTCACAATACCA^AAAAACCCTGGTGTC 

ATC AC AAG C AG AG ' ' ' ' ( 

ACACCTAAACAGGTAAGCACTCAGACCTCTACAGACAATCATCTGCTTGGTACCATGCTACCGGACG 
AACATGCTCCCCT \ 1 » 

GATG I I I 1 I GCCTTTTGCTCTCTCACTAACAGGCTCTfcCTCTCACTTATCAACTGTTGACACTTGT(3C ' 
GATCTCTGATGG ■ \ , , - 

CTGTCCTGCAGAAATCTATGAGTTTTTCCCTTATGGGGAGTTTGGCCGCCAGCTGCCTGCTTCTCATT 
GCCCTGTGGGCC 

CAGGAGGCAAATGCGCTGCCCGTCAACACCCGGTGCAAGCTTGAGGTGTCCAACTTCCAGCAGCC 
GTACATCGTCAACCG 

CACCTTTATGCTGGCCAAGGAGGTACAGCTGCATCTCTTTCTCTCCATACCGCCTTGCCATTTTCTCT 
GAAGCACTTGCA 

AACTCTTTAGGGGCGCTTTATCTCCGCAGGTCTCACTACCTATGTTTTCTGTCTCTTTAGA^ 
TAAGGACTGGGT ( f 

CI I I I I CTATTTCTATTTCAAGGTCTCAGGACCATTTCCTATC1TGGCCTTCAGGACACATATACTGAA 
TTTTATCTACA 

GAGGCGCATTTAGAAAGCCACCCACGACTGCAATACTTTCCATTTCTCTGTGCTCTCTTCTGAACTCA 
TACTCTCTTGGC 

TACTCCTGAGACCCACTGCGGACATACATCTCTACTTACAGGCTTTTCTTCCATCTCCTTGTCACCCA 
GGCACTTAGGGT ! 

TTTCTCTCTTTCAGGCCAGCCTTGCAGATAACAACACAGACGTCCGGCTCATCGGGGAGAAACTGTT 
CCGAGGAGTCAGT 

GTAAGTCCTCACTGTGATGAGCAGGGCTAGCTGCGGGAGCTGGTGGACCCTCTGGGATAGTCTGAC 
GTATGACCCCTGCT 

GCTTCTTGTCTACCTGCAGGCTAAAGATCAGTGCTACCTGATGAAGCAGGTGCTCAACTTCACCCTG 
GAAGACGTTCTGC 

TCCCCCAGTCAGACAGGTTCCAGCCCTACATGCAGGAGGTGGTACCTTTCCTGACCAAACTCAGCA 
ATCAGCTCAGCTCC 

TGTGTAAGTCTGACTCTGGCTACCTATGCTCCTCTCTCTTCCTCTTCTATTCCAGTAAGAACCCGAGG 
TCCTGCCCTCTC 

TCTCTTCACAAGAGTGAGGAGGGCCTCAGCACCACCACCATCATAGGCCACTTGAAATAGGTCACAA 

AGG CTTTG G CTTC 

AATTGAGTAATACTTTGAGTTTGTATGAGTGM 
AAATTCTGTAG 

GATGAGAAAGATGTTGGGAACGAAAAAAGGCCTAGATAGAGAAACAGATCTGCTGAGTATAGTACTT 
ATGGGGGGAGCAG 

GGGGCGATATCCACTGAGTACAAGJACTTGTGGGGAGAGAAATCCACTGAGTACAAGTACTTGTTG 
GCATGGAGATCCAC 

TGAGTACAAGTACTTGTGGGGGGAGGGAATGGCACAGAGCAAAAGTTGAAGGGAAGGAAGATGGA 
GAGGCCTCATGGTTG 

GGGGTGTGAAAGGTCACTCCTT1TCCATGTGATGGAGAGTTAAGAAAAACCAGTGTGTGAGTTTGAT 
GTCTTCAGACACC 

CCCAACTATGAAACATATCCACGAGGAGCGGGCAGACTGTGGGAGACCTGGCATTTAGGGAAGGC 
G CG G CTTTTC ACACG 
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.AQAAACTX^GCTC^TCTCTTG^^ 
G^AGTCra^ 

' SSmf MTACTMG ^^ 

tg^La^gc? 
cctag\g A Jcc^ 
. ca^cI^g^ 

actagaaagctgggcacaaacttactagagatgatttttgagctcattaaacggatgctctgaaatgt 

GGCAAAATCAAC i 

ccaga^aac^caaaagagctggatttgcaaatagg 

J^^j^G^CCTATATATATATTTAAGAtrAAA 

agagtaaaaatatcagtcatggattaa™ 

C I I I I I ACCTTC 

A J|TCTTAGI N I I I I I I ICTTCACACCCTGATCAAGCCACTAGTAAGCACCTATCTGCTfiTriArsr'.TAT . 

ACAGCAAACAACATTGCTGTGTGGCCTCTTTGGGGAAGGGAACAGGATAGCAGGAGGCTCAGGPTA 
G C AAGTCTG ACTTG 

ta^tc^a^cTgg GCATGG ^ GATAGCAGAGA ^ 

^ G f!J^ CGG ^^ TGGAA ™TCAGTAAGAW 
CTC I I I I I C I I G 

tgtgtaggctgataaacacacttgttttcttttgagtgttcatggctttgtagatttttagtgctctgc 

CAGTTCTTGTT 

a g a gggttottaccttgacacctgggcttggatgttagcatgccaaa 

tattattcatttactttgtctttggaaaggtgaagcgtgtgtgagaaagaactcacaggagatgtgtt 
ctctgtaggaaa 

ac i i i i i i i i iccccttaaatgcctataatccactttcagtcaactttgacrntataccatgctgtcac 

ATGA^^\GAG"F 

^^^AGGCCCGCTCTCATG G CTCTG G G AAAAG C ACC AATAG G G G AAG G AATGTTATG CTG AG AAAT 

AACTGGTCAGAGCTCCCCCGAAGACCACCACAGGTGTTAAGTAGGAACAGTCCAGGGTGGGCTCAT 
GTAATAG AATG G AA 

CAGAGCGAGGGAAGATAAGCTACAAAGTTTCATAGGGTCCGGAGTCTTAAAGATACAAAATAGCTGC 
TTGGGCTTCATAA 

C AAAGG AAGTCTGGG AAG G C AG C AAGTG AG AG GG AAATG G AAAG G G AAAAAACAG AATGTAG AC5f5 
ACTTGAACAGCTACA 

AATCCTCTACCAGACGATTTTTCTTGGAACAATCTAGAAGGTAGTGGATTAGGTGATTGCAGGGGGA 
CTTGCTTTGCCAT 
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TTGAATCTGGGTTTTTGTCTCTCCATTGAGGT^ 

GAAAGAAGGGGT >\\ ' ' , '\ " *' " ' ' ' ' " '* ,< 

GTTATGACTCCTACCTGGAGTTTTACTAGT^ 

CAAA^AAAATGGA ' !»;, ' V i t , t ■ • ' ', i 

aacctgttgtttgtcttgtttgttcttttgttaAgaaagcacaggcaaXgcccgac 
atgtgggtcttt 

gagtcaaggcttttgagttgagcactcatcaatagttga^catggtcaggtggagggctacctgtca 

GGCCGAGCCCTGC " ■ , ' ■ ' *' ■ 1 • , ' . * ■ , t ■ 

TGGCTTCGCACTTAACATCTCCAGGTCTCAGTATCACTTCCTGCTACTTAGCACAGTTAGGAGTTGA 
GCAAACC I I I I I I . ' ' ' , ' i" 

TCC A4CCCCC ACTAAAATTTAATTG AC AAAAG ACTGTGTAAJTT&TG G G AT ACv\GTGTGATXATTGAT 
CTATGTGTGCAT ' ' ' ' 1 ' 

TGTGCAAGGTTCAATAAGATAGATTAATAGGCCCATCAACAG ( CTTTATGGGTGTGAAATGCAAGTAAT 
ATAGGTAGATGC ' < 1 , 

CTGTGGTGTCCTTAGGTCAGAAAGGCATGATTTTAAGGTCTTGGGdAAATCATATTATACTCATGCTA 

' AAAATAC ATTAT ' i ' \ . t , < 

GTTGATTATTAATCTTTJAGAGAAGG 

TCTTTCTAACTG ' . 

GTACCACTTTAGAAAATGCTACCTGTGCTCAAATTGGTTTGTATTCTTATTTTCATAG 

GTGGAGAGATCA 

AGGCGATTGGGGAACTGGACCTGCTGTTTATGTCTCTGAGAAATGCTTGCGTCTGAGCGAGAAGAA 

G CT AG AAAACG AAG 4 , . ' 

AACTGCTCGTTCCTGCCTTCTAAAAAGAACAATAAGA 

TG AG AAG CT AAC ' i 
GTCCATCATCATTAGAAGATTTCACATGAAACCTGGCTCAGTTGAAAAAGAAAATAGTGtC 
CCATGAGACCAG 

AGGTAGACTTGATAACCACAAAGATTCATTGACAATATTTTATTGTCACTGATGATACAACAGAAAAAT 
AATGTACTTTA 

AAAAATTGTTTGAAAGGAGGTTACCTCTCATTCCTTTAGAAAAAAAGCTTATGTAACTT 
TCCAATATTTT 1 

ATATATGTAAGTTTATTTATTATAAGTATACATTTTATTTATGTCAGTTTAT^ 
q AAAC ATT A 

TCTGCTATTGATATTTAGTATAAGGCAAATAATATTTATGACAATAACTATGGAAACAAGATATCTTAG 
G CTTTAAT AAA 

CACATGGATATCATAAATCTTCTGTCTTGTAATTTTTCTCCCTTTAATATCAACAATACCATCATCATC 
TCATTACCCA 

ATCATTCTCATGATTTCATGCTTGACCCAT ATTAT ACTGTTAAAGTTGGTTCCTGGAGGCCTGTGGTT 
TTGTGTGTGTTG 

TGTGTGTGTGTGGGGTTATGCATGTGAAAGCCAGAGATGGATATTAGGTGTTCTTCTCTATCAGTCTT 

TGCCTTATTATT ^^^^ 

TG AG AC AGG GTCTGTC ACTG AACCTGTAGCTAG G CTG G CC AAC AAG CTCTATTAAI I 1 1 I I I I AAG AT 

TAATTAATTATG 
TGTAT 



exon 1a: nucleotides 1908-1932 
exon 1b: nucleotides 2033-2262 
exon 2: 2655-2720 
exon 3: 2820-2963 
exon 4: 3931-3996 
exon 5: 6539-7137 
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Human TIF cDNA i . • ' ' ' i l 1 I ; 

M ' ,. '/I 1 * -'i. ' ti 1 1 ! , , - ' i 

i i t t i 

, TGCACAAGCAGAATCTTCAGAACAGGTTCTCCTTCCCCAGTCAcbkGTTGCTCC^rTTWnk'*^ ' ' 
, , TCTGCAATGGCCGCCCTGCAGAAATCTGTGAGCTCTTTC^^ " 

■ ^^ CTCTTGGCCCT ^^ , 
, TTGACAAGTCCAACTTCCAGCAGCCCTATATCACCAACCGCACCTT^ 
AGCTTGGCTGATAACAACACAGACGTTCGTCTCATTGGGGAG^ 

GAGTGAGCGCTGCTATCTGATGAAGCAGGTGCTGAACTTCACCCTTGA^ . 
, AATCTGATAGGTTCCAGCCTTATATGCAGGAGGTGGTGCCCTT^^^ 

' ™ g ^ catgtcatattg ^ ggtgatgacctg ^tatccagaS ■ 

'S A ^^ A ^ GCTTGGAGAGAGTGGAGAGATCAAAGCAA TTGG ■ ' 

^ G ^SJ GAGAAATGCCT GCAT^^ ■ 
CCTGCTAGAAATAACAATTAGATGCCCCAAAGCGATTTTT' i iAAU1AA f- c CCCTTTC 
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[ ■ . ' • ii 

u * Human genomic sequence of TIF 

, • . . ■ i i . , ■ • ■ • 

' I TGCACAAGCAGAATCT7CAGX^CAGGTTCTCCTTCCCCAGTCACC\G7T^ , 

AATCTGTGAGCTCTTTCCTT^TGGGGACCCTGGCCACCAGCTGCCtcCTTpCTTGGCCCTCTTGGTACAGGGAGGAGCAGCTGCGC 
CCATCAGCTCCCACTGCAGGCTTGACAAGTCCAACTTCCASCAGCCCT^^ 

acatctcaatcccgctctttcccgttggatctacctggaarccaaacagztctcaaacttttcttcagagca'cctctaagagcttta 
ggaacccaccgctcacccccgagggtagataaattttctgrttttccEgagacwCtttgggaatctggctctttccttttcttgaac 
, ttcttccttccacctcggccrttacgatacatatgatgaatcctccccaaagagcggccattcagtaatccacccgatgattttttti 
tccctttatgcccccgtgcaztgttctaaactcatgcacacatctgaartccgcttttagtcctcatgatgctgctctggggagacg. 
ggatggggcacatgtccatcracaaactttttttctatttcctcaatgiccagacccctagtcttctctcctcttccagGCTAGCrT 
GGCTGATAACAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTgt a'agctacagttgtgacgaacagggccg 
tgcgccgcccacgggcaccrggggtggtggtgatgatggttcaggrcttatcccttacgaccctttctgttccccttccacctgcag 
ATGAGTGAGCGCTGCTATCTCATGAAGCAGGTGCTGAACTTCACCCTTGA^^^ 

I TATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGrTAAGCACATGTgtaagttcagctCCcagcctatgcccacct 
accccccctcccccccccccacagagaccccctcaccccaactcccccccctcccccctacccccaagccagcaggaagaagtgtct 
tggcagcagtg-taccaggagccacttgggaecatagagtacctgctttcgcc etgactgagecacatct tgagcttatagtggtga 
atggggtctggaacccaaaictacagaagccgcactggtctgccrccgsaaaaaaggcaactcaggttgcgtaagatgagaaaggcg 
ccgggaaaacac^tagctcccgaaatggacccaccgageccaag — cicgagcggaggggatggcatggagagaaaetagaagagaa 
, agtgggaaatgggaaggc* caaagtcggcggtgggtcggcagaccg-rgccccgttgatgccatgggaagccazaaaatcggaggcg 
tgtgaactcgacgccgcvc£*cattcgaaactacgaaaa£aagccccagcggagcgggcccagtaaaaggcccwaggacttactgaa 
gagggctcaatctcca=arcagatgttttatgcacatctcr cgt cci aagcatgcaacc tecCggagatacgattgaggttttattc 
cttacagaatctgcataaacraccccgctcctcccacaaargcaaaccrcagcaggatttcccaaagatgaagagaggtcccccgca 
agggaagtgactggactcrg=cgcccaagggaatccaagagctcag=aaatctaggccaccgttgaaatccaggccattgtgggcaa 
aattactaagagcttcaar r rcagg^gaactgtaccgtacc-ccat ccgtgtcgaggctcataaagct tcagcacaacattaagata 
gttatgcttctcattc-tccai agCATATTG AAGGTG ATG ACCTG Z ATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAA 
AAGgcaggactgacaactc-caatgccaagtcatgcaataggagaracaaaccccgt ttttctttcct tcctttctccccatcactt 
tgtgatct ttcacctgaztcicctaccaccagcgcgattactt tcg-gzctgtccatgtagacatatctatacacctagatgccagt 
Ctccaaatcttgcaaatcg-agaattccagaaccggctgccaccccagcccgcctagccacacaacctcagattctggggatggtca 
gcggcagagatagggctagaa.gcaggccccctgaatcccaagccag cat etc tcccggtggcgacacagac cage tttggcaccat 
CaactcC tagggaaatttcaractcctatcgacccatgtaaccccaagaagtacetgeteaaaaacagaaaaacgcctatgggcaaa 
tccacccgaagtcatttctgaagtcaccaatgcaccgccccgaaazitggaagaacaaacccagaacaatgagaaaagagccggact 
tgcacacagggccaaccwctggagtaataaacacccatcctgaattatcataacatctaccagataccgaccatagcctaaaagcaa 
gageagacaaccccgacccc" t cacacaggt tcaaatasagtaaaaataccagcaagagatccattatagttaaatggaagtctga 
actggCaagcctctcccccrzcctcccccccaccaagacccccca, .ctagccccctccttcactcccccaacaaaCccccagggag 
catctatccacggcgggc;c=;gcacatccctatagtgaatgataccaicacgtggcctacctggtgaaaagaacaacaacggaagg 
cttagactaacaatageg* rtcaccccaaaaccggaggaatgatcaggagcagcgaaagtgacgctcttgcaagcaggtacaaccaa 
atactcagaaacacgaagscrccagccgacggaatcttcaccaacaagcccaaccccaatcccccccttttcccccctgacttttta 
aaaaagcgtccctccccca=catcacttaatgagcgtgaccgctccttcccttgacaatcgaaggccttgtagcctcaaattgtgaa 
gcccagccccctcgccatagaaccatcatctagacatggacggcccaargccagcacgccacagacaaggcacgccctacacacctt 
gcctaaaaaactacegaccccatcecgct cgttgccCttacaaaagzcaagtg-gagagaggagaatcccatggcgatctgtgtgat 
ttccaagacctccaacccaccccgaaagaaccaatcccatactcccaacgggtcgccatgtggaagagcgattacgcttttttgctg 
gtagcttcagaaagcacaggagggagagcaatgctgctcagagaaagatcaacaggaggagaaactgtcagagctgtctgaaatagg 
gCggtCCCgggaggcatcaactcccccccgttgggggtaaaagcagaacgcaggttggtagtaaaatgcacgacagacagtagggga 
cgataaacctcaaaactctciacagtcttggagccctcgagacagaaaagaataccttttcggccttacgtcaaaagaagtatggaa 
aggcgaaagggcggaagaaagcaggaaaaggaagaaccatgtatcataragaggacaacggtgacaaggtttcccttgaaataatge 
aaatatgacagaccagaggaatctcagtagggaatgcttcccaccceaacccgggtcccctcttcgattaagttcgggaccctcatc 
tgcatctgacttcgagacagaaagaacgaatgtcaggacccacatccggccccctattaactaaagcaagtggaaaagacttatttg 
gcatttctcccacaaaagtgaaaacttcccttccactgcctgtcaaaaaggtggaaatagaaaaagccttaatgcattggtgaatac 
atggttcaaagccacccgagtagagacgtttcaaatcaggagcgtccaatcatctggcttccctggaccaccttgaaagaattgtct 
cggtacacacacaaaacacaagaacaatagctgacgagccaaaaaagtccatgcacaaatctcatactgctttaagaaagtttatga 
attcctgccagggtgcatccaaagctgtcctgggccatgcgcggcccgrgggctgcaggttggacaagctcctcataagtaatctgt 
catagatagttctggagcccraaaacaggc^eaggcataatgggcggcacccgggatcccccagatcccagccccacttcagtctcc 
ttgctctggttaagaaggggiggtcaaccctctgcccagcccttaaacagccccattagtgcgaggtgcacccgaaattgatgcctg 
ctggcggcctcccagcccagagagccgtcatttcaagctctttggcaaaccatacaatactaaagggatactaccatgaatgtttta 
caaatgcttaaaacccggcttctgcccccaccaacctaatcccgcaattcccaatctgttcactttagaaaacatggcataaatgct 
caaat ac 1 1 C tgcat t cc c at c c tcacagCTTGGAGAGAGTGGAGAGATCAAAGCAATTGGAGAACTGGATTTGCTGTTTATGTCTC 
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TIFbeta Genomic Sequence C * ^ S^O ) , i?7' ' ,i. 

gaattcaagtccacatgcaatc/^tccgaXtactttgtaaattctcttcttc 
gtataagttattgtaggatcatttaa aaat aatgttt^ 

agagaattagcaaatgtatagtattattttattttaaaaaatctatgctt 

cactactgacatttccaaacttaactt^ . 
aactgtgtgctcacttaccatgctatccgacgagcat^ 

cgctaacaggc tctcc tctcagttatcaacttttgacagttgtgcgatpggtgatgggtgtcctgca 

gaaatctatgagjttttcccttatggggactttggccgccagctgcctgcttctcattgccctgtgg 

gcccaggaggcaaatgcgctgcccatcaacacccggtgcaagcttgaggtgtccaacttccagcag 

ccgtacatcgtcaaccgcacctttatgctggccaaggaggtacagctgcatctctttctctccatac 

cgccttgccatttctctgaagcacttgcaaactcttitaggggcgctttatctccgcag 

cctatgttttctgtctctttagagactctttaaggactggatcttttjctatttcta t 

aggaccatttcctatcttggccttcaggacacatatactgmttttatctacagaggcgcgt^ 

agccacccacgactgcaatactttccatcctgttgtgctctcttctgaactcatactctcttggctac 

tcctgaga ccca ctgcggacatacatctctacttacaggcttttcttccatctccttgtcacccaggc 

actt ag g gttttctctc^ttcaggccagccttg c ag ataac aacac ag acgtccgg ctcatcg g g g a 

GAA 1 

ACTGTTCCGAGGAGTCAGTGTAAGTCCTCACTGTGATGAGCAGGGCTAGCTGCGGGAGCTGGTGGA 

CCCTCTGGGATAGTCTGACGTATGACCCCTGCTGCTTCTTGTCTACCTGCAGGCTAAGGATCAGTGC 

TACCTGATGAAGCAGGTGCTCAACTTCACCCTGGAAGACATTCTGCTCCCCCAGTCAGACAGGTTCC 

GGCCCTACATGCAGGAGGTGGTGCCTTTCCTGACCAAACTCAGCAATCAGCTCAGCTCCTGTGTAA 

GTCTGGCTCTGGCTACCTATGCTCCTCTCTCTTCCTCTTCTATTCCAGTAAGAACCCGAGGTCCTGC 

CCTCTCTCTCTTCACAAGAGTGAGGAGGGCCTCAGCACCACCACCATCATAGGCCACTTGAAATAG 

GTCACAAAGGCTTTGGCTTCAATTGAGTAATACTTTGAGTT^ 

CCATGGAAAGAAATCAACTCAAATTCTGTAGGATGAGAAAGATGTTGGGAACGAAAAAAGGCCTAGA 

TAGAGAAACAGATCTGCTGAGTACAGTACTTATGGGGGGGGGGGGCAGGGGGCGATATCCACTGA 

GTCCAAGTACTTGTTGGGAGAGAAATCCACTGAGTACAAGTACTTGTGGGGGAAGGAATGGCACAG 

AGCAAAAGTTGAAGGGAAAGAGGAAGATGGAGAGGCCTCAATGTTGGGGGTGTGAAAGGTCACTCC 

TTTTTCCATGTGATGGAGAGTTAAGAAAAATCAGTGTGTGAGTTTGATGTCTTCAGACACCCCAACTA 

TGGCAGACTGTGGGAGACCTGGCATTTAGGGAAGGCGCGGCTTTTCACACGAGAAACTTTATGCTC 

ATCTCTTGTGCTACACTCCCACCTTTGATGAGGTTAAGCTCAGGTTTCGTTTCTACCGTTCTTGCTAC 

TGGTGGAAACTTCAGTAGGATTCCCCAAAGACGAGGACAGCTCTTCTGTAAGGGAGGGACCTGGAT 

TTCAGTGTCCTAGAGAACGAAA 

TAGCTCAGAGAATCTAGGTCAACGTGAAATCTAGGTCACAGCGGGCAAAAATGACTGAACGCCTCTA 

TTCCAGGTGAACGGTCACGTGCCTCAGATATACTGAGGTATTGGGCTCCCACCGGATAAGATTCTGT 

TAGTGAGTCTGCTTTTATTTTGCAGCACATCAGTGGTGACGACCAGAACATCCAGAAGAATGTCAGA 

AGGCTGAAGGAGACAGTGAAAAAGGTACTATTGGCAAGCCACAATACTAAGCCATTCAGTAGGAGA 

C G TG G G G ATTTCTTTCTCTG CTTC C C AGTCTCTTCTACTTTGTAAC ATTTTCTTTG ACTTGTCTACTGT 

CTGGTCCATTACTCACTTAGCTGCACCTGCATCTAGCTGGGTCTATAGATCTTTCAATCTGTGTCTAA 

ATTTGTAAGTCACAATTCTGGAGCTAGCAGAAAGCTTAGCTCAGCCAGTCTCATGAGCACTTGCTCG 

GAGGATGGCTTGTGACAGAGTCAATGCTAGAAGACAGCATCCCTGATTCCCAGCTCTGCACTTGCCT 

AGTGGCCACGTGTAATTACTTTAGCCTGATTAAGTATTTGGGAAAGCCAATTCCCACCGACCTACATA 

ATCCGAAGAAGCATGCATTGAAAACTAGAAAGCTGGGCACAAACTTACTAGAGATGATTTTTGAGCT 

CATTAAACTGATGCTCTGAAATGTGATCAAATCAACCCAGAATAACAACAAAAGAGCTGGATTTGCAA 

ATAGGACAAGTATTTAGAATCACTG^GTATTAACAGCTGTCATCTTAATTAAAATATAGTGTCTATTTA 

CTGCCTATTTAAGATTAAACACAAGAGTGGATAACTTCCCAATTTACTGGGCCTGGTTTCAATAGAGT 

AAAAATATCAGTCATAGATTAATTATAGTGTCATGAAAGTATGAGTT^ 

ACCTTCATTTCTTAGTTATTA I i I I I I I I I CTTCACACCCTGATCAAGCCACTAGTAAGCACCTATCTG 
CTGCGA 

GCTATTATATGACTTTACAGCAAACAACATTGCTGTGTGGCCTCTTTGGGGAAGGGAACAGGATAGC 
AGGAGGCTCAGGCTAGCAAGTCTGGACTCAACCTAAAGCCAGAGGCATGGTTGATAGCAGAGAAAG 
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TGAGGCTCTTCACAAGTGGGTGTGCTTAAGTAATCAGAAACA^ 1__ ■ ■ 

«*GTAAGATATCTACCCTTATCTC^^ 

CTGATAAACACGCTTGTTTTC-rnTGAGTGT^CAf ^^m^^^^JSS!^^ 0 ' 

GTTAGAGGGTTTGTTACCTTGACACCTGGGCO^^ 

TGSCTGTGTAAAAGGTTATTATTCATrTACT^ 

TCACAGGAGATGTATTCTCTGTAGGAAAACTTTTTTTTCCC^ 
AACTTTGACTTTTATACCATGCTO^^ 

AAGCACCAATAGGGGMGAAATGTTATGCCGAGAAATCTGACTGGC^^ 

CCCCAAAGACCACTACAGGTGTTAAGTAGGAACAGTCGAGG^ 

GAGGGAGGGAAGATAAGCTACAAAGTTTCATAGGGTCCTAAGT^^^ 

GCGTCACeCTVTTTTACCCTGGATAGGAGGAGGAAAGAAGGGG'TGTTTTnArT^XTA^A^ 

CCTTTAATCCCAGCATTTGGGAGGCAGAGGCAGGTGACmCTA^ 

AAfcGTGAGTTOCAGGACAGCCAGGGCTATACAGAGAAACCCTG^ 
J^^AGAAAAGAA^ 

GGAGAGGAGAGGAGAGGAGAGGAGAAGAGAAGAGAAGAGAAGAGAAGAG/S^^^A^r^r^?^^ 

AAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGMGAGAAGAG^ 

AG AAAAG AAAAGAAAAAAGC AAGC AAG C AAGC ACTG GCAAAG C ATGCCCACA^GG^^r^^T^Y»n 

GTCTTTGAGACAAGGCTTTTGAATTGAGCGCTCATCAATAGTTGATCATG 
CJGTCAGGCCGAGCCCTGCTGGC^^ 

tagc^cagttaggagttgagcaaacctttttttccmcccccactaaaa™ 

TGTAATTTGTGGGATACAGTGTGATAATO^ 

caataggcccatcaacagctttatgggtgtgaaatgcaagtaatataVg^^ 

S^A C ^ 

AAGCGATCGGGGAACTGGACCTGCTGTTTATGTCTCTGAGAAATG^^ 
GCTAGAAAACGAAGAACTGCTCCTTCCTGCCTTCTAAAAAG^ 

^GAAAGAGAAAATAGTGTCAAGTTGTCCATGAGACCAGAGGTAGACTTGATM^ 
TTGACAATArrTTATTGTCATTGATAATGCAACAGAAAAAGTATGTACTTTA^AAAA 
AGGTTACCTCTCATTCCTCTAGAAGAAAAGCCTATGTAACTTCAT^CCA-TAA^ 
GTAAGTTTATTTATTATAAGTATACATTTTATTTATGTC 

TAGGCTTTAATAMCACATGAATATCATAAATOT^ 
^CCATCATCGTCATCATTACCCAATCATTCTCATGACTTCATGCT^ 



exon 1: nucleotides 355-584 
exon2: 976-1041 
exon 3: 1141-1264 
exon 4: 2204-2269 
exon 5: 5225-5824 
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